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Abstract 

We initiate a study of the homomorphism domination exponent of a pair of graphs F and 
G, defined as the maximum real number c such that |Hom(F, r)| ^ |Hom(G, T)|'^ for every 
graph T. The problem of determining whether HDE(f , G) ^ 1 is known as the homomorphism 
domination problem and its decidability is an important open question arising in the theory 
of relational databases. We investigate the combinatorial and computational properties of the 
homomorphism domination exponent, proving upper and lower bounds and isolating classes of 
graphs F and G for which HDE(F, G) is computable. In particular, we present a linear program 
computing HDE(F, G) in the special case where F is chordal and G is series-parallel. 
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1 Introduction 

A well known corollary of the Kruskal-Katona theorem states that a graph with e edges can have 
at most e^/^ triangles. More generally one may ask: given two graphs F and G, if we know that a 
third graph T has a copies of as a subgraph, what can we say about the number of copies of G 
in T? This paper is an attempt to pursue a systematic study of a general question of this type. 

For (directed) graphs F and G, a homomorphism from F to G is a function ip from the vertices 
of F to the vertices of G such that for any edge {u,v) of F, the pair {lp{u), (p{v)) is an edge of 
G. The set of all homomorphisms from F to G is denoted Hom(F, G), its cardinality is denoted 
hom(F, G), and we write F G if hom(F, G) ^ 1. 

Given a graph T, one can consider the profile of its "subgraph counts" given by the numbers 
hom(F, T), as F varies over all finite graphs. The set of all possible profiles encodes much in- 
formation about the local stucture of graphs. This motivates the following central meta-question 
in graph theory: find all relations that the numbers horn {Fi,T), ... ,hoir\ (Ft, T) must satisfy in 
every graph T. Unfortunately, a satisfactory understanding of these relations has thus far been 
elusive. This failure is explained by the following simple but striking result (due to loannidis and 
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Ramakrishnan |IR95j , discovered in the context of theoretical databases) : given graphs Fi, . . . ,Ft 
and integers ai, . . . ,at, it is undecidable whether for all graphs T, the following inequality holds: 



^aihom(Fi,r) ^ 0. 



The undecidability (via a reduction to Hilbert's 10th Problem) already holds if we restrict t = 9. 
Thus, one cannot hope to fully understand the relative magnitudes of subgraph counts of even just 
9 graphs at a time! Given this unfortunate fact, we set our sights a little lower, and attempt to 
study the relative homomorphism numbers from two graphs. 

For graphs F and G such that F ^ G, the homomorphism domination exponent of F and G, 
denoted HDE(F, G), is defined as the maximal real number c such that hom(F,T) ^ hom(G, T)'^ 
for all "target" graphs T. The HDE is a parameter encoding deep aspects of the local structure of 
graphs, and we believe that it is worthy of further study. As a concrete goal, here we consider the 
question of computing HDE(F, G) given graphs F and G. 

Another motivation for the HDE comes from the theory of databases. The containment problem 
for conjunctive queries (under multiset semantics), a problem of much importance in database 
theory, is equivalent to the homomorphism domination problem in graph theory which asks, given 
graphs F and G, whether hon)(F, T) ^ hom(G, T) for all graphs T. The homomorphism domination 
exponent is a quantitative version of the homomorphism domination problem (or the conjunctive 
query containment problem); note that the homomorphism domination problem is simply the 
question whether HDE(F, G) ^ 1. 

Many classical inequalities involving graphs are naturally viewed in terms of the homomor- 
phism domination exponent. For example, the Kruskal-Katona Theorem determines the maxi- 
mum number of triangles in a graph with a given number of edges. This relationship is cap- 
tured by the equality HDE(I,<:!^) = 2/3. Similarly, a result of Kovari, Sos and Turan jKST 54]. 
which establishes a relationship between the numbers of vertices, edges and 4-cycles in a graph 
G, states that hom(C4,G) ^ (hom(I, G)/hom(» , G))'*. This is summarized by the inequality 
HDE(C'4 + I) ^ 4. In Section 1.3 we give an overview of known results from extremal 

combinatorics that imply general bounds on the homomorphisms domination exponent. 

Our principal objective in this paper is to give algorithms for computing and bounding the 
homomorphism domination exponent. We introduce new combinatorial techniques for proving 
inequalities between homomorphism numbers and establishing their tightness. 

1.1 Overview of Results 

We prove a lower bound on HDE(F, G) when F is chordal and G is any graph such that F ^ G. 
This lower bound has the form of a linear program over the convex set of G-polymatroidal functions 



(defined in Section 2.3). In the special case where F is chordal and G is series-parallel, this linear 
program computes HDE(F, G) exactly. A relaxation of this linear program turns out to be an upper 
bound on HDE(F, G) for all graphs F and G. These results are stated formally in Section [3j 



Our bounds yield several new inequalities for graph homomorphism numbers. For instance: 

5 
2' 

HDE(any directed tree of size n, the directed n-cycle (7„) = 1. 
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Let Pn denote the undirected path of size n (with n vertices and n — 1 edges). Our main theorem 
imphes: 



HDE(P„,P„) = 1 
HDE(P™,P„) = m/n 



when m ^ n, 

when m ^ n and m is odd. 



However, when m ^ n and m is even, the value of h\DE{Pm, Pn) is shghtly less than m/n (by an 
amount that depends on n mod m): 



HDE(P2,Pn) = l/rn/2l, 
'l/n 

HDE(P4,P4n+i) = { 



if i = 0, 

2/(2n + l) ifi = l, 

(4n + l)/(4n2 + 3n + 1) if i = 2, 
l/(n + l) ifi = 3. 



These expressions were discovered by solving the linear program in our main theorem for small 
values of n (which then suggested proofs for arbitrary n). The equation HDE(P4, P4„_|_2) = (4n + 
l)/(4n^ + 3n + 1) (stated as Theorem 3.4) in particular stands out as an example of an intriguing 
phenomenon associated with the HDE. Its proof (included in ^ seems like it might be hard to 
come up with by hand. We remark that finding a closed expression for h\DE(Pm, Pn) for all m and 
n is an open problem. 

By contrast, HDE(Cm,C„) for cycles Cm and C„ contains no surprises. An anonymous referee 
pointed out that Holder's inequality implies that HDE(Cm,C„) = min(m/n, 1) in all cases when 
Cm — ^ Cn (i.e., m is even or n is odd and n). 



Finally, we mention that our results (Theorem 3.1) can be used to give another proof — using 



entropy methods — of Sidorenko's conjecture |Sid91] for the special case of forests. 



1.2 The Method via an Example 

We prove our bounds using an approach based on entropy and linear programming. We now briefly 
illustrate our methods in action on a simple example. The argument is inspired by the entropy 
proof of Shearer's lemma, often attributed to Jaikumar Radhakrishnan, and its generalizations due 
to Priedgut and Kahn |FK98L lEYiOi] . 

Consider the graphs Vee and Cs pictured below. 



U2 «3 Vl 




Vee 



We will prove that HDE(Vee,C3) = 1. (This problem was posed by Erik Vee |Vee06| : a differ- 
ent solution and generalization were given by Rossman and Vee |RV06j .) As \\om{Vee,C^) = 3 
and hom(C3,C3) = 3, we have HDE(Vee,C3) ^ 1. It remains to show that for all graphs T, 
hom(Vee,T) ^ hom(C3,T). To that end, fix an arbitrary graph T such that C3 — )• T. Pick 
X uniformly at random from Hom(C3,T). For i = 1,2,3, let ai = xi^i)- Observe that the 
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joint distribution (01,02,03) is uniform on a subset of Vr x Vr x Vr of size hom((73,r). Thus 
HI(ai, 02, as) = log hom(C3, T). We now prove that ]H(ai, 02, as) ^ log hom( Vee,T). 
By the chain rule of entropy, 

H(ai, a2, as) = ]H(ai) + E[(a2|ai) + m{as\ai,a2). 

As conditioning on fewer variables can only increase entropy, we get 

]H(ai,a2,as) ^ H(ai) + H(a2|ai) + M(a3|a2). 

Now, by cyclic symmetry of ai, 02,03, we have IHI(as|a2) = IHI(a2|ai). Thus, 

(1) 11(01,02,03) ^M(ai) + 2]H(a2|ai). 

We will now interpret this expression. Consider the distribution (x, y , y') on Vr X- Vr x Vr defined 
as follows. First, x £ Vr is picked according to the distribution of 01. Next, two independent copies 
y, y' G Vt of 02 conditioned on oi = x are picked. The entropy of (x, y, y') is easily computed: 

n{x, y, y) = ]H(x) + ]H(y|x) + n{y'\x) = ]H(ai) + M(a2|oi) + ]H(o2|ai). 

Thus, we have 11(01,02,03) ^ ]HI(x,?/, y') by ([l]). 

Distribution (x, y, y') was constructed so that there is always an edge from x to y as also from 
X to y' . Thus, every point of x Vt x Vr in the support of the distribution of {x,y,y') specifies 
a unique homomorphism in Hom{Vee,T), namely the map ui x, U2 y and 1— t- y' . This 
implies that loghom(C3,r) = 11(01,02,03) ^ log hom( Vee, T), completing the proof. 

The proof of our lower bound on HDE(F, G) for chordal graphs F and arbitrary graphs G follows 
the same strategy as the argument above. When we want to prove that for all T, hom(F, T) ^ 
hom(G,r)'^, we start with a uniform distribution on Hom(G, T). We analyze its entropy and 
compare it with the entropy of several auxiliary distributions that we construct on Hom(F, T). The 
construction of the auxiliary distributions, as well as the analysis and comparisons of entropies are 
guided by a linear program. 

1.3 Related Work 

Several computational problems closely related to the computability of the homomorphism domina- 
tion exponent are known to be undecidable. Validity of linear inequalities involving homomorphism 
numbers was shown to be undecidable by |IR95] via a reduction from Hilbert's 10th problem on 
solvability of integer diophantine equations. The homomorphism domination problem with "in- 
equality constraints" is also known to be undecidable | JKV06] . 

Inequalities between homomorphism numbers have been extensively studied in extremal combi- 
natorics. For a survey, see [BCL"'"06] . Very few general results are known about the homomorphism 
domination exponent (defined here for the first time, but implicitly studied before). Alon |Alo81j 
showed that if e is an undirected edge and G is any simple graph, then HDE(e,G) = ^j^^, where 
p{G) is the fractional edge covering number of G. This result was reproved and generalized to 
hypergraphs by Friedgut and Kahn ^FK98j . Their argument used Shearer's lemma, which is closely 
related to the entropy techniques that we use. A wonderful exposition on using entropy and 
Shearer's lemma to prove classical inequalities can be found in |Fri04j . Galvin and Tetali |GT04] . 
generalizing an argument of Kahn j Kah01| . also using entropy techniques, showed that for any 
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n-regular, A^- vertex bipartite graph G, HDE(K„^„,G) = Finally, a very general approach to 
inequalities between homomorphism numbers in dense graphs was developed in |BCL+n6[ iRainT] . 
However, it is not known whether this approach can yield algorithms for deciding validity of special 
families of inequalities between homomorphism numbers. 

The entropy arguments that we use differ from the above applications in that we utilize finer 
information about conditional entropy. The key technical device that enables us to use this in- 
formation is the construction of auxiliary distributions using conditionally independent copies of 
the same random variable. This is exemplified in the example of the previous subsection by our 
definition of the distribution {x,y,y'). 

Paper Organization. Section [2] introduces the necessary definitions and tools related to graphs 
and homomorphisms. Our results are formally stated in Section [3| Definitions and auxiliary 
lemmas on Markov random fields are given in Section [4j Proofs of our main theorems are presented 
in Sections [5| [6j [7] and [8| We state our conclusions in Section [9| 

2 Preliminaries 

We first fix some basic notation. For a natural number n, let [n] denote the set {1, . . . ,n}. The 
powerset of a set X is denoted by p{X). If 5 is a family of sets, let f]S denote the intersection 
Hse^ "S*- We adopt the convention that = 0. 

2.1 Graphs and Homomorphisms 

Graphs will be finite and directed. Formally, a graph is a pair G = {Vg, Eg) where Vg is a nonempty 
finite set and Eg is a subset of Vg x Vg- For a subset j4 C Vg, we denote by G|a the induced 
subgraph of G with vertex set A. We denote by k-G the disjoint union of k copies of G. The 
(categorical) product F x G oi graphs F and G has vertex set VpxG = Vp x Vg and edge set 
EpxG = {{{a,v), (b,w)) : (a, 6) G Ep and {v,w) G Eg}. 

A graph G is simple if the relation Eg is antirefiexive and symmetric, i.e., if {vjw) G Eg then 
V ^ w and {w, v) G Eg- Every graph G is associated with a simple graph G defined by V^^ = Vg 
and Eq = {{v.,w) : v ^ w and (u, G Eg or {w,v) G Eg}- Whenever we speak of cliques, 
connectivity, etc., of G, we mean cliques, connectivity, etc., of the associated simple graph G. In 
particular, a clique in a graph G is a set of vertices A (^Vg such that (f , w) G Eg or (tt;, v) G Eg 
for all distinct v,w ^ A. We denote by Cliques(G) the set of cliques in G and by MaxCliques(G) 
the set of maximal cliques in G. The number of connected components of G is denoted by CC(G). 

A homomorphism from a graph F to a graph G is a function ip : Vp — > Vg such that 
{ip{a) , ip{h)) G Eg for all (a, 6) G Ep. Let Hom(F, G) denote the set of homomorphisms from 
F to G and let hom(F,G) = |Hom(F,G)|. Notation F ^ G expresses hom(F,G) ^ 1. Under 
disjoint unions (+) and categorical graph product (x), hom(_, _) obeys identities 

hom(Fi + F2, G) = hom(Fi, G) • hom(F2, G), 
hom(F, Gi X G2) = hom(F, Gi) • hom(F, G2). 

A graph F is chordal if the simple graph F contains no induced cycle of size ^ 4. Chordal 
graphs are alternatively characterized by the existence of an elimination ordering. A vertex v is 
eliminable in a graph F if the neighborhood of w is a clique in F. An enumeration vi, . . . ,Vn of Vp 
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is an elimination ordering for F if Vj is eliminable in } for all j G [n]. By a well-known 

characterization, a graph -F is chordal if and only if it has an elimination ordering. 

A 2-tree is a chordal graph with clique number at most 3 (i.e., containing no K4). A graph G 
is series-parallel if G is a subgraph of some 2-tree. 

2.2 The Homomorphism Domination Exponent 

We now formally define the homomorphism domination exponent. 

Definition 2.1 (Homomorphism Domination Exponent). For graphs F and G such that F — )• Cj^ 
the homomorphism domination exponent HDE(i^, G) is defined by 

HDE(F,G) = sup{c G R : hom(F,T) ^ hom(G,r)^ for all graphs T}. 
We write F )^ G and say F homomorphism- dominates G if HDE(i<', G) ^ 1. 

The following dual expression for HDE(F, G) is often useful: 
(2) HDE(F,G)= inf ^o^^o^iF^T) 

^ ' ^ ' T:hom(G,T)>2loghom(G,r) 

We remark that this inf is not always a min. 

The following lemma (proof omitted) lists some basic properties of the homomorphism domi- 
nation exponent. 

Lemma 2.2 (Basic Properties of HDE). 

(a) Ifc = HDE(F,G), then hom(F,r) ^ hom(G,r)^ for all graphs T. (That is, we can replace 



sup hy max in Definition 2.1.) 

(b) The homomorphism- domination relation )p is a partial order on graphs. 

(c) HDE(F,i/) ^ HDE(F,G) • HDE(G,//). 

(d) HDE(m-F, n-G) = ^ • HDE(F, G) for all positive integers m,n. 

(e) // there exists a surjective homomorphism from F onto G, then F )^ G. 

(f) HDE(F, G) > z/ and only if U^eHom(F,G) Range(v9) = Vg- 

By ([2]), every graph T with hom(G, T) ^ 2 provides an upper bound on HDE(F, G). By taking 
specific graphs Ti, T2 and {T^,n)n^i in the figure below, we get the following general upper bounds 
onHDE(F,G). 



o o o o ^ 

Ti T2 T3,„ 

Taking T = Ti, we get the upper bound HDE(F,G) ^ |Vf|/|Vg|. Taking T = we have that 
HDE(F, G) ^ CC(F)/CC(G). A slightly more complicated upper bound follows by taking T = T^^n 
and letting n — )• 00; the result is that HDE(F, G) is at most the ratio a{F) / a{G) of the independence 
numbers of F and G, since hom(H,T^ n) grows like G(n"(-f^)) for every graph H. 



^We do not define HDE(_F, G) whenever F -fr G. However, it might be a reasonable convention to let HDE(F, G) 
-00. 
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2.3 G-Polymatroidal Functions 

Definition 2.3. For a graph G, let V{G) and Q{G) be the fohowing sets of functions from piVc) 
to [0,1]. 



A function p : p{Vg) - 

(0 at 0) 
(monotone) 
(submodular) 
(G-independent) 



■ R is G-polymatroidal if it satisfies the fohowing four conditions: 
p(0) = 0, 

p{A) ^ p{B) for Sil\A<ZBQ Vg, 

p{A nB)+ p{A U ^ p{A) + p{B) for ah A,B <Z Vg, 
p{AnB) +p{A\JB) = p{A) +p{B) for ah A,B OVg 
such that AdB separates A\B and B\A in G (i.e., 
there is no edge in G between A\B and B \A). 

A G-polymatroidal function p is normalized if in addition it satisfies: 
(normalized) p(Vg) = 1- 

• V{G) denotes the set of normalized G-polymatroidal functions. 

• Q{G) denotes the set of functions q : p{Vg) — > H which satisfy: 

g(0) = 0, qiA) ^ for ah AQVg, ^ qiA) ■ CC(GU) = 1. 

ACVg 

Example 2.4. Let a,b,c be the vertices of K3. Then 'P{K^) is the set of convex combinations of 
eight functions from p({a,6, c}) to [0,1], which we label as fa,fb,fab,fac,fbc,fabc (corresponding 
to the seven nonempty subsets of {a, b, c}) and /rs ("RS" stands for Ruzsa-Szemeredi, for reasons 
that will be explained later on), given by the following table: 








{a} 


{b} 




{a, 6} 


{a,c} 


{b,c} 


{a,b, c} 


fa 





1 








1 


1 





1 


fb 








1 





1 





1 


1 


fc 











1 





1 


1 


1 


fab 





1 


1 





1 


1 


1 


1 


fac 





1 





1 


1 


1 


1 


1 


fbc 








1 


1 


1 


1 


1 


1 


fabc 





1 


1 


1 


1 


1 


1 


1 


/rs 





1/2 


1/2 


1/2 


1 


1 


1 


1 



We will use the following identity for G-polymatroidal functions when G is chordal. 

Lemma 2.5 (Identity for Chordal-Polymatroidal Functions). If G is chordal, then for every G- 
polymatroidal function p : p{Vg) — > IR- and every elimination ordering vi, . . . ,Vn for G, 

v{Vg) = E -(-i)i^ip(n^) 



5CMaxCliques(G) 



({neighbors of Vi among vi, . . . , Vj-i} U {wj}) — ^({neighbors of Vi among 



■.Vi-l 



})• 



i=l 



Lemma 2.5 is established by a straightforward inductive argument (proof omitted). 
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3 Results 



Our first theorem gives a lower bound on HDE(F, G) when F is chordal. 
Theorem 3.1. If F is chordal and G is any graph, then 



HDE(F,G)^ min max 

p€:V{G) (peHom(>,G) 



E 



■(-1)1*1 -/iMns)). 



SCMaxCliques(F) 



Theorem 3.1 is proved by a generahzation of the entropy technique iUustrated by the example 



in n.2 



Our second theorem gives an upper bound on HDE(i^, G) for general graphs F and G. 
Theorem 3.2. For all graphs F and G, 

HDE(F,G)^ min max V • CC(FL-1M^) 



The next theorem establishes that Theorem 3.1 is tight in the special case where G is series- 
parallel. 



Theorem 3.3. // F is chordal and G is series-parallel, then 

^ -(-1)1^1. p(^(n5)). 

5CMaxCliques(F) 

The final theorem (mentioned in the introduction) is an example of an interesting HDE com- 



HDE(F,G)= min max 

p&V{G) (peHom(F,G) 



putation discovered with the help of the linear program of Theorem 3.3 

4n + 1 



Theorem 3.4. HDE(P4, P4„+2) 



4n2 + 3n + 1 



Theorems 3.1 3.2 3.3 and 3.4 are respectively proved in Sections pi pi and ^ 



Discussion 1. Tightness of our lower and upper bounds 



The HDE upper bound of Theorem 3.2 is not tight for all pairs of graphs. For instance, F = C4 + 
2-Ki (an undirected 4-cycle plus two isolated vertices) and G = K2, it holds that HDE(F, G) = 8/3, 



while Theorem 3.2 only implies HDE(i<', G) ^ 3. However, we can show that Theorem 3.2 is tight 



when (the underlying simple graphs of) F and G are forests. 

We do not have any example of a chordal graph F and a graph G for which the HDE lower 
bound of Theorem |3.1| is not tight. However, there are reasons to believe that the tightness of 



this lower bound is not the question. Recall that the linear program in Theorem 3.1 has domain 
V{G), the set of normalized G-polymatroidal functions. In fact (as will obvious from the proof of 



Theorem 3.1), we can replace V{G) with the subset {hx ■ X G MRF(G)} of normalized entropic 



functions of Markov random fields over G (defined in the next section). Let £{G) denote the closure 
of {hx '■ X G MRF(G)} in IR,^"^. The set £{G), whose members are called G-entropic functions, 
is a convex subset of V{G) and a well-studied object in information theory. When \Vg\ ^ 3, we 
have £{G) = V{G). However, these sets do not coincide in general. For instance, £{Ki) is a proper 
subset of V{Ki) (due to the existence of "non-Shannon information inequalities" on 4 random 
variables); in fact, Si^JC^) fails even to be a polytope. While it seems unnatural to conjecture that 



the HDE lower bound of Theorem 3.1 is tight as stated, the same conjecture for the corresponding 
linear program over £{G) would appear more reasonable. 
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Discussion 2. Theorem 3.2 is a linear program relaxation of Theorem 3.1| 



It is worth pointing out that the hnear program in the HDE upper bound of Theorem 3.2 is (after 
a hnear cha nge of variables) a direct relaxation of the linear program in the HDE lower bound 
of Theorem 3.1 To see this, consider the invertible linear transformation L : IR^^^'^^ — > R^^^^g) 
which takes a function / : p{Vg) — > K, to a function Lf : p{Vg) — > H defined by 

{Lf){A) = 

B : A\JB=Vg 

We need a combinatorial lemma on chordal graphs. 
Lemma 3.5. Suppose F is chordal. 
(a) For all ACVp, 



E (-1)'"= E (-l)l^"^ICC(F|^). 

S'CMaxCliques(F) B : AUB=Vf 

(b) For every function f : p(Vf) — > II, 

E -i-if^fins) = E • cc(fu). 

5CMaxCliques(F) ACVp 

(c) For every homomorphism cp : F — > G and function g : piVc) — > 

E -{-lf\g{^{nS)) = E (^9){A) ■ CC(F|^-i(^)). 

5CMaxCliques(F) ACVq 

Lemma [3. 5| can be proved by an inductive argument, or alternatively, using elementary algebraic 
topology (Euler characteristics of flag complexes associated with chordal graphs). Statement (a) is 
the essential identity; statement (b) follows directed from (a); statement (c), which is the result we 
need, is a slight extension of (b). 

As an immediate corollary of Lemma [3.5|^c), we get: 



Corollary 3.6 (Alternative Statement of Theorem |3.1[ ). If F is chordal and G is any graph, then 
HDE(F,G)^ min max V g(^) • CC(FL-im)). 

To see that the linear program of Theorem |3.2| is a direct relaxation of the linear program of 



Theorem 3.1, it suffices to show that Q{G) Q L{V{G)) for all graphs G, which can be checked by 
applying to an arbitrary function in Q and seeing that the resulting function is normalized G- 
polymatroidal. Indeed, for any q G Q(G), the function L~^q is given by {L~^q){A) = YIbciVg ^(^) ' 
CC(G|(p-i(yli-i5)), which one can show is normalized G-polymatroidal. 
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4 Chordal Pullbacks of Markov Random Fields 



A (probability) distribution over a nonempty finite set is a function X : 0, — > [0, 1] such 
that "^^^Q X{uj) = 1. We denote by D\5t{X) the set of all distributions over Q. The support 
of X is the set Supp(X) = {u e ft : X{uj) > 0}. The entropy of X is defined by m{X) = 
X^weri — X{lj) log X{uj). Since the uniform distribution maximizes entropy among all distributions 
with a given support, it holds that ]H(X) ^ log |Supp(X)|. 

For a finite set I, we refer to distributions X S Dist(r2'^) as called I-indexed joint distribution 
(with values in 0,). We view the coordinates Xi {i G /) as random variables taking values in fi. We 
speak of independence and conditional independence among random variables X^. For all J I, 
we denote by Xj the marginal J -indexed joint distribution {Xj : j £ J) viewed as a distribution in 
Dist(Q^). 

For an /-indexed joint distribution X, we denote by hx ■ p{I) — > [0, 1] the normalized entropy 
function of X defined by hx{J) = M(Xj)/M{X). By Shannon's classical information inequalities 
(see [Yeu06j ) . the function hx is monotone and submodular. 

For a graph G, a Vc-indexed joint distribution X G Dist(r2^'^) is a Markov random field over G 
if H(Xa) + H(Xb) = ]H(Xaub) + H(XAnB) for aU A,B (^Vq such that Af^B separates A\B and 
B \ Am. G. By Shannon's information inequalities, for X € MRF(G), the function A i — > M.{Xa) 



is G-polymatroidal (recall Definition 2.3). Hence, assuming ]H(X) > 0, the normalized entropy 



function hx belongs to V{G). By Lemma |2.5[ it follows that 



(3) H(X)= -(-l)l^llH(Xn5). 

SCMaxCliques(G) 

We denote by MRF(G, il) the set of all Markov random fields over G with values in fJ. We write 
MRF(G) for the class of all Markov random fields over G. Note that MRF(G) depends only on the 
underlying simple graph of G. If Gi and G2 are simple graphs such that = and Eq^ 5 Eq^, 
then MRF(Gi) C MRF(G2), i.e., every Markov random field over Gi is a Markov random field over 
G2. 

Example 4.1. For all graphs G and T such that G ^ T, the uniform distribution on Hom(G, T), 
viewed as an element of Dist((VT)^'^), is a Markov random field over G with entropy log hom(G, T). 

The next lemma gives a mechanism for constructing one Markov random field from another. 

Lemma 4.2 (Fullback of a MRF). Let ip be a homomorphism from a chordal graph F to a graph 
G. Then for every X G MRF(G,r2) there exists a unique X G MRF(F, fi) (called the puUback of 
X along (p) such that for every clique C G Cliques(F), the marginal distributions {Xc : c G C) 
and (-'^^(c) ■ c £ G) are identical. Moreover, if 0, = Vt where T is a graph such that Supp(X) C 
Hom(G,r), then Supp(X) C Hom(F,r). 

We already saw pullbacks of Markov random fields in action when we computed HDE( Vee, C3) 
in C2I 



Proof Sketch. We can construct X according to the following procedure. Fix an arbitrary elimina- 
tion ordering vi, . . . ,Vn of F {so that Vj is an eliminable vertex of for all j G [n]). We now 
pick values for Xy-^, . . . ,Xy^ (i.e., the coordinates of joint distribution X = (X„)„gi? G Dist(0^^)) 
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in order. Assuming values X^^, . . . , X^j^. j have been picked, we next pick Xy. according to the 
distribution X^p(^^.-^ conditioned on ^^(t,,) = Xy^ for z = 1, . . . , j — 1. 

One can show that the resulting distribution X is a Markov random field over F. Indeed, 
it is the unique Markov random field meeting the conditions of the lemma; in particular X is 
independent of the particular elimination ordering wi, . . . , of F. In the event that Q, = Vt where 
T is a graph such that Supp(X) C Hom(G, T), it is easy to show that every point of (Vr)^^ in the 
support of X is a homomorphism in Hom(F, T). □ 



5 Proof of Theorem 3.1 (HDE Lower Bound for Chordal F) 

Suppose F is chordal and Hom(F, G) is nonempty. Let T be a graph such that hom(G, T) ^ 2. Let 
X e Dist((VT)^'^) be the uniform distribution on Hom(G,r) (so X £ MRF(G), see Example [4T|. 
Let hx ■ p{Vg) — > [0, 1] be the normalized entropy function of X and note that hx G T-'iG) and 

/ix(^) =IHI(XA)/loghom(G,r). 

For each homomorphism ip G Hom(F, G), let G MRF(F, Vt) be the pullback of X along 93, 
as described in Lemma [42} We have Supp(y^) C Hom(F, T) and hence M.{Y'f) ^ loghom(F, T). 
By equation ([s]) we have the following identity (independent of the graph T): 

H(y^) = -{-^rHx^ins)) = E -{-^f\hx{^{mmx). 

5CMaxCliques(F) 5CMaxCliques(F) 

It follows that 

loghom(F,r)^ max V -{-l)\^\hx{^{{^S))\og\\om{G,T). 

^ ' S'CMaxCliques(F) 

Since this inequality holds for all graphs T such that hom(G, T) ^ 2, we have 

HDE(F,G)= inf (by ^2 

^ ' ' T:hom(G,T);^2 log horn (G,r) ^ ^ ^ 

^ inf max V -{-l)\S\hj,{^{i^S)). 

T:hom(G,r);^2veHom(F,G) ^ 

ACMaxCliques(F) 

Since hx S 'P(G) for all T, we get the desired result that 

HDE(F,G)^ min max V -(-l)l"^lp(w(n5)). 

pePCG) ^eHom(F,G) ^ , 

' ^ ^ ^ 5CMaxCliques{F) 



6 Proof of Theorem 3.2 (HDE Upper Bound) 



Fix a graph G and a function q £ Q(G). That is, let g be a function from p{Vg) to [0, 1] such that 
g(0) = and EacVo '?(^) " CC(GU) = L 

We define a sequence (T„)n^i of "target" graphs as follows. Vertices of T„ are all pairs {x,i) 
where x G Vq and i E f^i'^'^^a-^^A} g, function from {A C Vg : a; G A\ to IN which satisfies 
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i{A) < n'^^'^K There is an edge in T„ from vertex {x,i) to vertex {y,j) if and only if {x,y) G Eg 
and i{A) = j{A) for all {x, y} C A C Vg- 

Let TTn denote the homomorphism from Tn to G defined by vr„((x, i)) = x. Let -F be a graph and 
suppose 99 is a homomorphism from F to G. We denote by Hom,^(F, Tn) the set of homomorphisms 
tp : F ^ Tn such that TTn o ip = ip, i-e-, the following diagram commutes: 



Tn 



F^-^G 

Let \\om^p{^F^Tn) — I Hom(p(i^, r„) I and note that 

(4) hom(F,r„)= hom^(F,T„). 

(^eHom(F,G) 

Lemma 6.1. lim log„ hom(p(F, r„) = \^ q{A) ■ CC(F| 

ACVg 

Proof. Let ip G Hom^(F, T„). Each vertex u £ Vp is mapped under ip to a pair {ip{u),iu) for some 
i„ G ]N{^cyG:'^(«)eA} g^^ject to < n'?^^). The family of functions (i„)„eVF is further subject 

to the constraint that iu{A) = iv{A) for all u, f G Vf and {ip{u) , ip{v)} C ^4 C Vg such that and 
V lie in the same connected component of Flj^-i^^^. To see this, consider an undirected path in 

from u to i.e., a sequence u = wo,wi,W2, . . . ,Wk = v such that {w£^i,W£) or 
is an edge in F|(^-i(^) for every £ G {1, . . . , k}. Suppose {(p{u), (p{v)} ^ yl C Vg and u, v lie in the 
same connected component of F|^-i(^). Then clearly {ip{we-i), ip{wi)} C A for all £ G {1, ... , k}. 
Since (wi-ijWi) or {wi,W£-i) is an edge in F and "0 is a homomorphism from F to T„, we have 
that {il;{we-i),ip{wi)) or ('^(t/;^), is an edge in T„. It follows that i<^(^^_j)(B) = i^(^^)(_B) 
for all C 5 C V^. In particular, we have i<^(^^_-^)(^) = i^{we){A)- Therefore 

iuiA) = i^Q^A) = • • • = i^^(A) = 

Conversely, every family of functions {ju G fii^'='^G-vi'u-)eA} . ^ g subject to jij(A) < n''^^^ 
and jM(^) = jv{A) for all u, u G Vp and {f{u), ^p{v)} C ^ C Vg such that u and u lie in the same 
connected component of F|^-i(^), determines a distinct homomorphism in Hom^(F, r„). Thus, 

hom^(F, Tn) equals the number of such families (jn)MeV>- This is precisely HyicvG 1"'^''*'^^ '-*-(^l¥J-i{A))"j 
since for each j4 C Vg and each connected component U of F|^-i(^), we have an independent choice 
of numbers rriA^ij G {0, . . . , [n'^^'^^] — 1} such that ju{A) = ttia^jj for all u €z U. Taking logarithms 
in base n, we get the statement of the lemma. □ 

Corollary 6.2. lim log„ hom(F, T„) = max V gM) • CC(FL-1M^). 

"^00 </3eHom(F,G) ^ ^ ' 

^ ACVg 



This corollary follows immediately from Q and Lemma 6.1 We are ready to prove Theorem 3.2 



Proof of Theorem \3.S\ Suppose F ^ G. For q G Q(G), let (T„)„^i be the sequence of "target" 



graphs as above. By Corollary 6.2 (applied to G), we have 



lim log„hom(G,r„) = max V q(A) ■ CC(G|(/?-^M)) ^ V qiA) ■ CCfGU) = 1 
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where the middle inequahty is obtained by taking Lp to be the identity homomorphism on G. 
We now have 



HDE(F,G)f hm |°gn ^^°^(^' ^ hm log„ hom(F, r„) = max V g(^) • CC(FL-imO 

^ ^ ^ n^oo log„ horn G, r„ n^oo ^" V , n; ^^^^^^^^^^ ) V Icp {A)j 



where the last equality is by Corollary 6.2 Since this inequality holds for all q G Q(G'), it follows 
that 

HDE(F,G)^ min max V gM) • CC(FL-1M^). □ 



7 Proof of Theorem |3.3| (HDE of Chordal F and Series-Parallel 
G) 



Suppose F is chordal and G is series-parallel and F ^ G. The HDE lower bound of Theorem 3.1 
states 

HDE(F,G)^ min max V -(-1)1^1 • p((/j(nS)). 

6CMaxCliques(F) 



Let p be an arbitrary function in V{G). To prove Theorem 3.3 (i.e., to prove this inequality is 
tight), we construct a sequence of graphs r„ satisfying 

(5) lim log„hom(G,r„) ^ 1, 

n— >oo 

(6) hm log„hom(F,rO^ max ^^ ^ -i-lf\pi^{f]S)). 

^ ' 5CMaxCliques(F) 

Tightness of the above HDE lower bound then follows from ^ . 

To simplify matters, we first consider the special case that G is chordal. (Since G is chordal 



and series-parallel, it has clique number ^ 3, i.e., G is a 2-tree.) After proving Theorem 3.3 in this 
special case, we give the argument for general series-parallel G in Section [7. 4[ 

We construct T = r„ in two stages. For every A G MaxCliques(G), we construct a graph Ta 
together with a homomorphism : Ta — > Ka (the complete graph on A, viewed as a subgraph 
of G). We then patch together (via a randomized gluing procedure) the various graphs Ta into a 
graph T together with a homomorphism vr : T — > G. (This indexing over maximal cliques in the 
chordal graph G is essential to defining the gluing procedure in a consistent fashion.) 

For a,b,c G Vg, we write p{a) , p{ab) , p{abc) for p{{a}),p{{a,b}),p{{a,b,c}) respectively. For 
Ac. Vg, we treat n^(^) as integers (by rounding), mindful to preserve identities such as nP(")+p(*'^) = 
jivio) jiP{bc) _ ggcause we are ultimately interested in asymptotics in log base n, this kind of rounding 
presents no difficulties. 



7.1 Construction of Ta 

Consider any A G MaxCliques(G) and note that \A\ G {1,2,3}. 

If 1^1 = 1 (say A = {a}), then Ta is the empty (edgeless) graph on n^(") vertices and -ka maps 
all vertices of Ta to a. 
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Now suppose 1^1 = 2 (say A = {a, b}). Letting 

(7) a = nP(''), /3 = nP(*), j = nP^^^+P^^'^-P^"^^ 

(note that 7 ^ 1 by submodularity of p), Ta is the graph ^-Ka^p (i.e., 7 disjoint copies of the 
complete bipartite graph Ka^p) and vr^ G Hom(TA, Ka) maps the two parts of each Ka^p to vertices 
a and 6 of iiT/i (i.e., the a-size part to a and the /3-size part to b). 

We now examine the nontrivial case when |^| = 3 (say A = {a, 6, c}). Consider the restriction 
of p to p(^). So long as p(A) > 0, the normalized function \ p{A) is i^^-polymatroidal 

(if p{A) = 0, then p \ p{A) is identically zero). By Example 2.4, it follows that p \ p{A) is a 
nonnegative hnear combination of functions fa,fb,fc, fab, fac, fbc, fabc and /rs- That is, 

p \ p{A) = Xifi for some Ai ^ 0. 

iS:{a,b,c,ab,ac,bc,abc,IlS} 

(We will harmlessly treat n'^' as integers.) Note the identities: 

p{a) = Xa + Xab + Xac + Xabc + ^Ars, 

(8) p{ab) = Xa + Xh + Xab + Xac + Xhc + Xabc + ^Ars, 
p{abc) = Xa + Xb + Xc + Xab + Aac + he + Kbc + |Ars. 

For each i G {a,b,c,ab,ac,bc,abc,IiS}, we will construct a graph TA,i and a homomorphism 
7rA,i ■ TA^i — > Ka- Once we have defined these, we obtain Ta as the fibered product of graphs TA,i- 

• the vertices of Ta are the elements (vi) G Hi such that '7TA,i{vi) = TrAj{vj) for all i,j E 
{a, b, c, a6, ac, 6c, abc, RS}, and 

• there is an edge between vertices (vi) and (wi) of if and only if there is an edge between 
Vi and Wi in TA^i for every i £ {a, 6, c, a6, ac, 6c, a6c, RS}. 

The homomorphism tta : Ta — > Ka is defined in the obvious way: 

• TTAiivi)) equals the common value of vr^,j(vj). 

We now define TA,i and 7rA,i for the various i S {a, 6, c, a6, ac, 6c, a6c, RS}. In all cases, after 
defining TA^i, the homomorphism TTA,i will be obvious. Also, the definitions of TA,b and Ta^c will 
be obvious after stating the definition of Tam, so we include only the cases i G {a, a6, a6c, RS}. 

• TA,a has vertex set ({a} x [n'*'°]) U {6, c} and edges {6, c} and {(a, i),6} and {(a, i),c} for all 

• TA,ab has vertex set ({a, 6} x [n'^"'']) U {c} and edges {(a, i), (6, i)} and {(a, «),c} and {(6, i),c} 
for all i G [n'^"'']. 

• TA,abc has vertex set {a, 6, c} x [n'^°'"=] and edges {(a, i), (6, «)} and {(a, i), (c, i)} and {(6, i), (c, «)} 
for all i G [n'^'''"^]. 

• If Ars = 0, then Ta,rs = Ka and tta is the identity function on A. 

To define the remaining graph T^,rs when Ars > 0, we use a result of Ruzsa and Szemeredi |RS78j . 
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Theorem 7.1 (Ruzsa-Szemeredi |RS78j ). For all m G IN, there exists a tripartite graph H{m) in 
which: 



(i) each part has size m, 

(ii) there are m'^~°^^^ triangles, and 

(iii) every edge is contained in exactly one triangle. 

(This is not the usual statement of the Ruzsa-Szemeredi result. However, it is easily seen to be 
equivalent to the usual statement that there exists a bipartite graph with parts of size m whose 
edge set is the disjoint union of m^~°^^^ induced matchings of size at least m^~°^^\) 



Using Theorem 7.1 we define Ta rs in the remaining case: 



• If Ars > 0, let Ta,rs be the graph H{n^^^s) of Theorem [tj] and let tta.rs S Hom(ryi,Rs, Ka) 
be any function mapping the three parts to a, 6 and c. 

Recalling the definition of Ta (as a fibered product of graphs Ta^i), it is easy to check using 
equations ([s]) that the graph Ta satisfies: 

I {vertices of Ta which map to o under tta}] = n^^°'\ 
I {edges of Ta which map to {a, b} under vr^}! = nP('''')-°(i), 

I {triangles in Ta}\ = nP^''^^)-"^!). 

Moreover, the o(l) terms disappear whenever Ars = 0. 
7.2 Gluing Procedure 

We now describe the randomized procedure for gluing together the various graphs Ta and homo- 
morphisms tta ■ Ta — > Ka into a single graph T and homomorphism vr : Ta — > G. It is enough 
to describe the procedure for gluing a pair of graphs Ta and Tg for A, B £ MaxCliques(G): there 
is an obvious way of simultaneously and consistently carrying out all pairwise gluings to obtain T 
and vr (relying on the chordality of G). 

Let A,B£ l\/laxCliques(G). There are three gluing procedures to consider, depending on |^n 
B\ £ {0, 1, 2}. In the simplest case that ACi B = ^, the gluing of Ta and Tb is just the disjoint 
union Ta W Tb and gluing of homomorphisms tta and vr^ is obvious. 

Next suppose that \AnB\ = l (say AnB = {a}). Note that |7r;^^(a)| = |vr;^^(a)| = nP'^'^\ The 
gluing of Ta and Tb is defined by starting with the disjoint union Ta tf Tb and identifying pairs of 
vertices in ■7r'^^{a) x 7r^^(a) under a uniformly choosen random bijection between sets 7r^^(a) and 
7r^^(a). 

Finally, suppose that |^ H i?| = 2 (say A D B = {a,b}). In this case, it must happen that 
1^1 = \B\ = 3. Define a,/3,7 again by equation ([T]) and consider the graph "f-Ka^p- We claim that 
bipartite graphs TaI^-i^^^^^ and TbI^-^i^^^ ^j.^ both look like 'j-Ka^p after deleting an n~°(^''-fraction 



of edges from the latter. (The proof of Claim 7.2, below, follows easily from definitions. 



Claim 7.2. There exist homomorphisms ^a '■ ^^U-ida b}) — ^ 7"-^a,/3 and ^b '■ '^A\.,^-ii^^ab}) 
^■Ka^p such that 

• E,A o,nd S^B are bijections (between vertex sets), and 
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• rnaps vr^ (a) to the aside of^-K^p and vr^ (6) to the jS-side of'^-K^p, and similarly for 

Moreover, O'fid ^sl^-i^j^jj^ both have at least n^+P+'y~"W edges (thus, these graphs 

may he obtained from 7-i^o,/3 by deleting an n~°^'^^ -fraction of edges). 

After fixing arbitrary and ^s, tfie gluing procedure works as follows. We pick a uniform 
random automorphism ^ of (i.e., an element of the group {Sa x S^) k S^). The function 

o ^ o is a bijection of sets vr^ ({0,6}) and vr^ {{a,b}). Starting from the disjoint union 
of Ta and Tg, we identify pairs of vertices under this bijection. Finally, we keep edges between 
pairs of identified vertices if and only if edges existed between these vertices in both Ta and Tg. 
(Intuitively, we randomly overlap Ta and Tb within the confines of ^-Ka^p and keep only the edges 
which occur in both Ta and Tb-) 

Having defined randomized gluings for pairs of graphs Ta and Tb, suffice it to say that these 
pairwise gluings can without difficulty be carried out simultaneously and consistently over all A S 
MaxCliques(G) to obtain the graph T and homomorphism vr : T — > G (chordality of G is crucial 
here) . 



7.3 Counting Homomorphisms from F and G 

Now that we have defined the sequence of graphs T„ and homomorphisms 7r„ : T„ — > G, it remains 
to prove inequalities ([s]) and Both inequalities follow from the following claim. 

Claim 7.3. If H is a chordal graph and 99 G Hom(i/, G), then 

log„ \{e G Hom(F, r„) : ^„ o ^ = (^}| = _ ^(i). 

5CMaxCliques(H) 

Before proving Claim [7^ let's see how it implies inequalities ([s]) and Q. To prove ([s]), we take 



H = G and ip = idy^ (the identity map on Vg viewed as a homomorphism G — > G) in Claim 7.3 
and see that 

log„, hom(G,r„) ^ log„, \{e G Hom(G,r„) : ^„ o = idy^}| 

J] -(-l)l"^lp(n5)-o(l) = 1-0(1) (by Lemma [2:5]). 

SCMaxCliques(G) 

Inequality ^ is immediate from Claim 7.3 taking H = F: 

lim log„ hom(F, r„) = lim max log„ \{6 G Hom(F, T„) : iTnoO = ip}\ (as hom(F, r„) ""~^'^> 00) 

n-s>oo n-s>oo ,^gHom{F,G) 

^ -(-i)i^ip((^(ns)). 

SCMaxCliques(/i') 

Now for the proof of this claim: 



Proof of Claim 7.3. We define a supergraph T* of T as follows. For each A G MaxCliques(G), we 
define a supergraph T^ of Ta and apply the same gluing procedure. If \A\ ^ 2, let = Ta. 
If l^l = 3 (say A = {a,b,c}), recall that Ta is the fibred product of graphs T^,a, • • • ■,TA,abc and 
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2a,rs; let be the fibred product of graphs r^,a, ■ ■ ■ , T^^abc and T\ j^g where p^g is the complete 

tripartite graph with all parts of size n^^'^^^^\ Viewing T^,rs as a subgraph of T^rs (with the 
same vertex set) and apply the same gluing procedure (i.e., with the same randomization), we view 
T as a subgraph of T* (with the same vertex set). It now suffices to prove the following: 

(9) logJ{0G Hom(F,r:) :7r„o0 = (^}| = 

-{-^f\p{ip{r\s)) 

5CMaxCliques(H) 

+ Yl |ARs(A)-|{A'GMaxCliques(i7):vp(^')=^}|, 

AeMaxCliques{G) : |A|=3 

(10) log„ Pr,eHom(J/,T-)[^ G Hom(F,r„)] = 

Y iARs(A)-|{A'GMaxCliques(i7):(^(^')=^}|-o(l). 

AeMaxCliques{G) : |A|=3 

We first give the argument for equation ([9]). Note the following: 

• for every edge (a, h) in G and every a' G 7r~^(a), 

\{h' G 7r-i(6) : (a', 6') is an edge in T*}] = n^i''^'^-P^''\ 

• for every triangle {a,b,c) in G and every a' G 7r^^(a) and b' G vr~-^(6) such that {a',b') is an 
edge in T*, 

|{c' G 7r-^(c) : (a',6',c') is a triangle in T*}\ = nP(«M-pH)+^ARs(aM. 

It follows that if f 1 , . . . , is an elimination ordering for H then 
logJ{^G Hom(i/,r„*) :7r„o^ = (^}| = 

n 

p(i/9({neighbors of Vi among vi,. . . ,Vi-i} U {««})) — p(9j({neighbors of Vi among vi,. . . ,Vi-i})) 

1=1 

+ Yl ^ARs(A)-|{A'GMaxCliques(//) :v?(^') = ^}|• 

AeMaxCliques(G) : |yl|=3 



Equation idh now follows using Lemma 2.5 



For equation (10), notice that a triangle {a',b',c') over {a,b,c) in T* is a triangle in T„ with 
probability n~^i's(«^)-o(i). Now consider a uniform random homomorphism G h\om{H,T*). 
For an edge {x,y) in consider the vertices zi,...,Zm such that {x,y,Zj) are triangles in i/. 
The key observation (using chordality of H) is that events {{6{x),6{y),9{zj)) is a triangle in 
T„}j=i^...^m are independent conditioned on 6{x) and 0{y). By expanding the probability that 



9 G Hom(ff, T„) conditionally along an elimination ordering, we see t 

probability Htriangies {x,y,z) in H ^ which proves (10) and completes the proof 

of Claim 



hat 9 ^ Hom(F,r„) with 

of 

□ 
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7.4 Series-Parallel G 



Finally, we prove the theorem for the case when G is series-parallel (but not necessarily chordal). 
Recall that for every series-parallel graph G, there exists a 2-tree G (i.e., a i^'4-free chordal graph) 
such that Vg = and Eq ^ E^. Fix any such G. 

Consider any p S V{G). Note that V{G) C V{G) (i.e., any normalized G-polymatroidal function 
is also normalized G-polymatroidal) . Ther efore, we can construct graphs Tn with homomorphisms 



TTn 



G such that (by Claim 7.3 applied to G and T„) for every chordal graph H and 



(/? G Hom(/f,G), 

(11) log„ |{0 G Hom(i/, T,) : vr„ o = ^}| = _(_i)|5|p(^(n5)) _ o(l). 

S'CMaxCliques(H) 

Let T„ be the subgraph of T„ which has the same vertices, but where we keep an edge {v,w) from 
Tn if and only if (7r„(z;), 7r„(t(;)) is an edge of G. Note that 7r„ is a homomorphism in Hom(T„,G). 
By (11), Claim [73] now holds (exactly as stated) for G and r„. The proof of inequalities ^ and 
Q then follows by the exact same argument. 



8 Proof of Theorem 3.4 (HDE of P4 and P4n+2) 



In this section we give the proof of Theorem 3.4 (the equation HDE(P4, ^4^4.2) = (4n -|- l)/(4n -|- 



3n -|- 1)), which was discovered by solving the linear program of Theorem 3.3 for small values of n. 
We include this proof as an illustration of a somewhat exotic phenomenon arising in the study of 
a simple HDE problem. 

Let P4„+2 = iV,E) where V = {0,1, . . . ,4n + 1} and E = {{0, 1}, {1, 2}, . . . , {4n, 4n 1}}. 
Define function / : V — > IN as follows: 

. /(0) = /(4n+l) = 2n+l, 

• /(4fc + 1) = fiik 3) = 2A; + 1 for A; G {0, . . . , n - 1}, 

• f{Ak + 2) = f{Ak + 4) = 2n - 2k - 1 for k e {0, . . . ,n - 1}. 

For every G W, we define a random graph Tjv = (Vm, Ej^) as follows. Let 

VN = {{v,i):v^V, i^{l,...,\Nf^^^^]]. 

Independently for all {v,i), {w,j) G Vat, place an edge with probability 

Pr [{{v,i),{w,j)}&EN] 

' if {v, w} = {4A;, 4k + 1} where k ^ {0, . . . ,n - I}, 

1 if {v, w} = {4k + r,4k + r + 1} where A; G {0, . . . , n - 1} and r G {1, 2, 3}, 
otherwise. 



It holds with high probability that 

hom(P4n+2,T^) ^ Ar4"^+3«+i-°(i). 
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1 1 



N 



N N'^ N N'^ N'-^ N"^ N'^ N N'^ N 

Figure 1: The random graph T]\j when n = 4 (drawn to logscale height). The value (1 or -^) 
in-between partitions of the vertex set indicates the probabihty of an edge. 



It also holds with high probability (by inspection of the various homomorphisms from P4 to P4n+2] 
that 



hom(P4,Tiv) ^ Ar4"+i+o(i). 



Therefore, 



HDE(P4,-P4n+2) ^ 



4n + 1 



4^2 + 3n + r 

We now prove the opposite inequality. We will represent homomorphisms P4 
4-tuples {ii,i2,h,H) € V^. Define a function w : Hom(P4, P4„+2) — > IN as follows: 

w((4/c,4/c + 1,4A:,4A: + 1)) = 1 for A; G {0, 

w{{Ak, Ak + l,4k + 2,Ak + l)) = 1 for k G {0, 

w{{4{n -k) + l, 4(n - A;), 4(n - k) - 1, 4(n - k))) = 1 for A; G {0, 

w{{Ak + 2, 4A: + 3, 4A; + 4, 4A; + 5)) = 4A; + 2 for k G {0, 

w{{4{n -k) + l, 4(n - k), 4(n - A;) - 1, 4(n - A;) - 2)) = 4A; + 2 for A; G {0, 



-P4n+2 by 



. ,n}, 

.,n-l}, 

.,n-l}, 

.,n-l}, 

.,n-l}, 



and let 'w{ip) = for all other homomorphisms ip G Hom(P4, P4n+2)- Note that 

^ w{(p) = An^ + 3n + l. 

V3eHom(P4,P4„+2) 

Fix any target graph T with at least one undirected edge. Let X G Dist((VT)^'') be the uniform 
distribution on Hom(G, T). Let $ be a random homomorphism in Hom(P, G) drawn according to 



4n? + 3n + l' 



Pr = (f] 

Let G Dist((yT)^^) denote the puUback of X along $ (so in particular Supp(F*) C Hom(P, T)). 
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Figure 2: The distribution <I> of homomorphisms P4 — > Pin+2 when n = 4. 

By a straightforward calculation using equation ([3]), we have 
(12) (4n2 + 3n + l)]Hy* 



]HX|o,i} - HXo) + (]HX|„,„+i| - MX4„+i) + J^(4n + 1)MX|4,, 4^ 



-1} 



n-l 

+ E 

fc=0 



/ (4n - 4/c)MX|4fc+i_4fc+2}' 

+ 4n]HIX{4fc+2_4fc+3} 

V+ (4A; + 4)]HX|4fc+3 ,4fc+4}/ 



fc=0 

/ (4n - 4/c)]HX4fc+i\ 

+ (4n - 4A; - l)]HX4fc+2 
+ {Ak + 3)MX4fc+3 

V+ (4A: + 4)11X4^+4/ 



By monotonicity and submodularity of the entropy operator (also using the fact that IHIX0 = 0), 
we have 



(13) 



^ < 



MX 



4n+l 



MX. 



{4?i,4n+l}! 



I]fc=o(4^ + 1) (^H-'^{4fc+l,4fc+2} - HX4fe+i - MX4fc+2j , 
Z]fc=0 HX{4fc+2,4fc+3} - - H-'^4fc+3, 

Efe=o(4n - 4A: - 3) (MX|4fc+3,4fc+4| - MX4fc+3 - MX4fe+4) • 



Adding each negative quantity in the lefthand side of equation ( 13 ) to the righthand side of equation 



(12), we get 



(4n2 + 3n + l)My* ^ (4n + 1) ^ IHIX|^^^} - ^ MX^ 

\{i',«)}eS ?;e{l,.--,4n} 

= (4n + l)MX by ([3]). 

It follows that HDE(P4, P4„+2) ^ -. 1 as required. 

4n + 1 

9 Conclusion 

The main open question is whether HDE(i^, G) is computable. (This question is equivalent to 



decidability of the homomorphism domination problem by virtue of Lemma |2.2[d).) Theorem 3.3 
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shows that HDE{F,G) is computable in the special case that F is chordal and G is series-parallel. 
Examples like HDE(Vee,(73) show that the homomorphism domination exponent can be tricky to 
compute even for very small instances. Our work also raises the problem of finding a closed-form 
expression for HDE(Pto,P„). So far, we only have closed expressions when m is odd or equal to 
2 or 4. Besides the applications in database theory, we hope that the homomorphism domination 
exponent will be seen as interesting parameter in its own right. 

Acknowledgements 

We thank Madhu Sudan, Noga Alon and Ehud Friedgut for insightful discussions. We also thank 
an anonymous referee for helpful comments. 

References 

[AI08I] Noga Alon. On the number of subgraphs of prescribed type of graphs with a given 
number of edges. Israel J. Math., 38(1-2):116-130, 1981. 

[BCL+06] Christian Borgs, Jennifer Chayes, Laszlo Lovasz, Vera T. Sos, and Katalin Vesztergombi. 

Counting graph homomorphisms. In Topics in Discrete Mathematics, volume 26 of 
Algorithms Combin., pages 315-371. Springer, Berlin, 2006. 

Ehud Friedgut and Jeff Kahn. On the number of copies of one hypergraph in another. 
Israel J. Math., 105:251-256, 1998. 

Ehud Friedgut. Hypergraphs, entropy, and inequalities. Amer. Math. Monthly, 
lll(9):749-760, 2004. 

David Calvin and Prasad Tetali. On weighted graph homomorphisms. In Graphs, 
morphisms and statistical physics, volume 63 of DIM ACS Ser. Discrete Math. Theoret. 
Comput. Sci., pages 97-104. Amer. Math. Soc, Providence, RI, 2004. 

Yannis E. loannidis and Raghu Ramakrishnan. Containment of conjunctive queries: Be- 
yond relations as sets. ACM Transactions on Database Systems, 20(3):288-324, Septem- 
ber 1995. 

T. S. Jayram, Phokion G. Kolaitis, and Erik Vee. The containment problem for REAL 
conjunctive queries with inequalities. In Stijn Vansummeren, editor, PODS, pages 80-89. 
ACM, 2006. 

Jeff Kahn. An entropy approach to the hard-core model on bipartite graphs. Combin. 
Probab. Comput, 10:219-237, 2001. 

T. Kovari, V.T. Sos, and P. Turan. On a problem of K. Zarankiewicz. Colloq. Math., 
3:50-57, 1954. 

A. Razborov. Flag algebras. J. Symbolic Logic, 72(4):1239-1282, 2007. 

I. Z. Ruzsa and E. Szemeredi. Triple systems with no six points carrying three triangles. 

In Combinatorics (Proc. Fifth Hungarian Colloq., Keszthely, 1976), Vol. II, volume 18 
of Colloq. Math. Soc. Jdnos Bolyai, pages 939-945. North-Holland, Amsterdam, 1978. 



22 



[RV06] Benjamin Rossman and Erik Vee. Counting homomorphisms from trees and cycles, 2006. 
Unpublished manuscript. 

[Sid91] Alexander Sidorenko. Inequalities for functionals generated by bipartite graphs. Diskret. 
Mat, 3:50-65, 1991. 

[Vee06] Erik Vee, 2006. Personal communication. 

[Yeu06] R. Yeung. A First Course in Information Theory. Springer, 2006. 



23 



